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Abstract. Recently, multi-scale notions of local homology (a variant 
of persistent homology) have been used to study the local structure of 
spaces around a given point from a point cloud sample. Current recon- 
struction guarantees rely on constructing embedded complexes which 
become difficult in high dimensions. We show that the persistence dia- 
grams used for estimating local homology, can be approximated using 
families of Vietoris-Rips complexes, whose simple constructions are ro- 
bust in any dimension. To the best of our knowledge, our results, for the 
first time, make applications based on local homology, such as stratifica- 
tion learning, feasible in high dimensions. 

1 Introduction 

Advances in scientific and computational experiments have improved our ability 
to gather large collections of data points in high-dimensional spaces. One aspect 
in topological data analysis is to infer the topological structure of a space given a 
point cloud sample. We often assume the space has manifold structure, however, 
more interesting cases arise when we relax our assumptions to include spaces that 
contains singularities and mixed dimensionality, for example, stratified spaces. 

Stratified spaces can be decomposed into manifold pieces that are glued to- 
gether in some uniform way. An important tool in studying these spaces is the 
study of the neighborhoods surrounding singularities, where manifolds of differ- 
ent dimensionality intersect. We focus on sampling conditions for such neighbor- 
hoods, which allow us to begin examining how difficult certain reconstruction 
techniques are with respect to the geometric properties of the underlying shape. 
Our main task is to infer sampling conditions suitable for recovering local struc- 
tures of stratified spaces, in particular, the local homology groups, from a possibly 
noisy sampled point set. 

Stratification learning. In stratification learning (or mixed manifold learning), 
a point cloud is assumed to be sampled from a mixture of (possibly intersect- 
ing) manifolds. The objective is to recover the different pieces, often treated 
as clusters, of the data associated with different manifolds of varying dimen- 
sions. Stratified spaces has been studied extensively in mathematics, see seminal 
work in |1I2) . Recently, topological data analysis, relying heavily on ingredients 
from computational topology [314] and intersection homology |5I6I7| has gained 
momentum in stratification learning. In particular, the work in [H] focuses on 
studying the local structure of a sampled stratified spaces based on a multi-scale 
notion of local homology (see Section [2]) . More recent work in [9] studies how 



point cloud data could be clustered by strata based on how the local homology 
of nearby sampled points map into one another. 

Reconstruction and sampling. Reconstructing shapes from potential noisy 
point cloud samples has been studied in many fields. Most often the work is 
heavily tied to a reconstruction criteria (e.g. homotopic, homeomorphic, etc.) 
and the assumptions on the underlying space (e.g. manifold). Combinatorial al- 
gorithms in geometry are generally derived from Dclaunay triangulations |10) 
and alpha shapes [TT], and provide correctness proofs associated with such re- 
constructions [121. As the dimension increases, reconstruction efforts have been 
redirected towards alternative combinatorial structures such as tangential De- 
launay complexes [13] . witness complexes [2], Cech complexes and the closely 
related Vietoris-Rips complexes [15116117) . 

However, these existing techniques are primarily concerned with global re- 
construction. Providing reconstruction guarantees for local structures is more 
challenging. To guarantee theoretical correctness in computing persistence local 
homology, both [5] and [5] use Delaunay complexes and their variants. However 
constructing Delaunay complexes in high dimensions is known to be difficult 
due to scaling and numerical issues with predicates. On the other hand, meth- 
ods for fast [18] and efficient |16|19j constructions of Vietoris-Rips complexes are 
available, and there have been theoretical advances on their topology-preserving 
qualities, making it appealing for computations in high dimensions. The goal 
of this paper is to make persistent local homology computation more practical 
through approximations. 

Contributions. Our contributions focus on providing sampling conditions to 
recover the local structure of a space from a point cloud sample, based on previ- 
ously introduced [5] multi-scale notions of local homology. Our main results are: 

— We extend previously introduced algebraic constructions in the analysis of 
scalar fields over point cloud data 20J to two multi-scale notions of local 
homology. 

— For both multi-scale notions of local homology, we approximate their persis- 
tence diagrams by constructing families of Vietoris-Rips complexes based on 
a set of sample points, formalized within Theorem [2] and [3] The simplicity 
and efficiency of building the these complexes in any dimension makes, for 
the first time, applications based on local homology such as stratification 
learning feasible in high dimensions. 

— We show that relative persistent modules are interleaved if the respective 
absolute persistent modules are interleaved. We consider such a technical 
result (Theorem [I]) of independent interest. 

— Our results imply algorithms for computing the local homology either by a 
reduction to standard persistence or a known variant. 

2 Background 

The background material focuses on the introduction of persistence modules |21j , 
local homology and its multi-scale notions [8|. We assume a basic knowledge of 
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Fig. 1: Strongly e-interleaved persistence modules. 



homology and persistent homology, see [22J23j for a readable background of the 
former, and [23] for a computational treatment of the latter. 
Persistence Modules. We use the definition of persistence modules adapted 
from |21j. A persistence module T = {-FoJaeR is a collection of vector spaces F a 
(over any fields) together with a family {/f : F a —> Fp} a <p of linear maps such 
that a < P < 7 implies f2 = fj ° /«, and f° = idp a . A persistence module 
is tame if it has finite number of critical values and all F a are of finite rank. 
Unless otherwise specified, we suppose all persistence modules we encounter in 
this paper are tame. 

Two persistence modules {F} a and {G} a are (strongly) e-interleaved if there 
exists two families of homomorphisms, /i Q : F a — > G a + £ and v a : G a — > F a+e , 
that make the following diagrams (Fig. [I]) commute for all a < f3 € R [2T]. 
The information contained in a persistence module can be encoded by a multi- 
set of points in the extended plane M 2 (where 1 = MU {— oo,oo}), called a 
persistence diagram [5S]. If two tame persistence modules are e-interleaved, the 
bottleneck distance between their persistence diagrams are upper bounded by e 
([25;, Theorem 4.4). In this paper, we consider persistence modules of homology 
groups and relative homology groups over a field. Given a family of topological 
spaces {Xq,} q connected by inclusions X Q X^, the inclusions induce a se- 
quence of homology groups connected by homomorphisms, Hfc(X a ) — > Hfc(X^), 
where k is the homological dimension. We therefore obtain persistence modules 
of the form {Hfc(X Q ,)} ct . Specifically, when the linear maps associated with two 
persistence modules {Hfc(X Q )} Q and {Hfc(Y Q )} Q are induced by inclusions at the 
space level X Q Y a+E and Y a c -t X Q+e , their fe-th persistence modules are 
e-interleaved [21]. For the rest of the paper, we sometimes abuse this notation 
by omitting the fc-th homology functor unless necessary. We work with singular 
homology here but our results are applicable in the simplicial setting as well. 
Local Homology. The local homology groups at a point x G X is defined as 
the relative homology groups H(X,X — x) ([22], page 126). In this paper, we 
assume that the topological space X is embedded in some Euclidean space M. d 
Let d x : R d — > K be the Euclidean distance function from a fixed x £ X, 
d x (y) := d(x,y) = \\y - x\\. Let B r = B r {x) = d^&r] and B r = B r (x) = 
d~ x [r, oo) be the sublevel sets and superlevel sets of d x . Taking a small enough 
r, the local homology groups in questions are in fact the direct limit of relative 
homology groups, lim^o H(X,X n B r ), or alternatively linv^o H(X n B r ,X <1 
9B r )[6], see Fig. [2} We then adapt two multi-scale notions of this concept based 

3 This assumption can be relaxed in several ways but this setting is most common in 
our applications. 
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Fig. 2: Local homology as the direct limit, lim^o H(X, X n B r ) (left) or 
lim r ^o H(XflB r ,Xn dB r ) (right). 

on persistence (which are first introduced in [5]), referred to as the r -filtration 
and the a-filtration. The goal of this paper is to derive sampling conditions 
that are appropriate to compute the persistence diagrams with respect to these 
filtrations, therefore approximating the local homology at x G X. 

For a fixed a > 0, let X Q be the "thickened" or "offset" version of X, that is, 
the space of points in R d at Euclidean distance at most a from X. Suppose L is 
a finite set of points sampled from X0 where L C X and L a = U xe LB a (x). In 
subsequent sections, we put further restrictions on L where we suppose L is an 
e-sample of X, that is, \/x € X, d(x, L) := inf v& l d(x, y) < e. 

The r -filtration (Fig. [3]) is a sequence of relative homology groups connected 
by linear maps induced by inclusion and excision, constructed by fixing a thick- 
ening parameter a and varying parameter r, for r' > r, 

► h (x Q , X Q n B r ' ) -> H(x Q ,x Q ns r ) 

The same filtration could be built on a set of points L sampled from X, that is, 

> H(L«,L a nB r ) H(L a ,L a flB r ') • •• . 

Here, we fix the space at resolution a, and vary the scale r at which we analyze 
the local neighborhood, analog to changing the lens from the front of the camera. 

The a-filtration (Fig. [4]) is a sequence of relative homology groups connected 
by inclusion, constructed by fixing r and varying a, for a < a' , 

► H(x Q n B r , X Q n dB r ) -> H(x Q , n s r ,x a - n as,.) -> • • • . 

Its discrete counterpart built on a set of points L sampled from X is, 

► H(L Q n B r , L a n dB r ) -> H(L a < n S r , L a ' n 9S r ) • • • . 

Here, we fix the size of the ball which defines the locality, i.e. the size r of the 
local neighborhood, and we vary the scale a at which we analyze the space. 
Cech and Vietoris-Rips Complexes. Suppose L is a finite point set in M. d 
and L a = \J xe i J B a {x). The nerve of L a is the Cech complex of L, denoted 
as C a — C a (L) (omitting L from the notation unless necessary). The Vietoris- 
Rips complex of L is denoted as lZ a , whose simplicies correspond to non-empty 
subsets of L of diameter less than a. For Euclidean metric space, we have, Va > 0, 
Ca/2 C lZ a C C a C 72-2a- This implies that the persistence modules {H(Cq,)} q 
and {\~\(TZ a )} a are a- interleaved. 

4 Our results would hold with minor modifications in the setting of sampling with 
noise, where elements of L lie on or near X. 
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Fig. 3: The r-filtration for space X and its offsets (left), and the same filtration 
built on a set of points L, sampled from X. 
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Fig. 4: The a-filtration for space X and its offset (left), and on the right, the 
same filtration built on a set of points L, sampled from X. 




3 Approximating Local Homology: en-Filtration 

In the a-filtration, since we will be computing relative persistent homology, there 
are certain requirements on the pairs, such that the maps of the relative filtration 
are well-defined. Two filtrations, A — {A a } ae Tsi and T = {-F a }aeR are called 
compatible if for all a < /3, the following diagram commutes: 



Ap > F p . 

This ensures that the relative persistence module is well-defined R In our con- 
text, all the maps are induced by inclusions hence the above diagram commutes. 
We highlight steps involved to obtain our approximation results: 

— First, we show that under certain conditions, the relative homology of a ball 
modulo its boundary is isomorphic to that of the entire space modulo the 
subspace outside the ball. 

— Second, we prove that if we have two compatible filtrations J- and A which 
are respectively interleaved with Q and B, the relative persistent homology 
H(J r , A) is approximated by H(Q,B). This result may be of independent 
interest. 

— Last, we prove a series of inter-leavings to show that both filtrations in our 
case can be interleaved with a Vietoris-Rips construction on the samples. 

We first show that the following two filtrations are equivalent (as a increases): 

0^ H(X a r\B r ,X a ndB r ) -K..-> H(B r ,dB r ), (1) 
-> H(X Q ,X Q -wHBr) H(M",R" -intS r ). (2) 

5 Note that this is equivalent to the condition given on pairs of filtrations under the 
two function setting [26| . 
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Fig. 5: Left: the a-filtration with respect to the pair (X a n B r , X Q D dB r ). Right: 
the filtration with respect to the pair (X a ,X Q — inti? r ). 

Note that X a — int B r = X Q — (X^nint B r ). Unless otherwise specified, a £ [0, oo). 
Graphically, these filtrations are shown in Fig. [5] As it turns out, it is easier 
to argue about the filtration in Fig. fright) than Fig. [5](left), as shown in the 
following lemma. Recall a pair of space (A, B) forms a good pair if B is a nonempty 
closed subspace that is a deformation retract of some neighborhood in A f|23|. 
page 114). 

Lemma 1. Assuming that spaces X Q Hi? r andJL a ndB r form a good pair, then 
H(X„ n B r ,X a C\dB r ) = H(X«,X« - intB r ). 

Proof Sketch. This follows from the Excision Theorem ([27], Theorem 15.1, page 
82) and the Excision Extension Theorem ([27], Theorem 15.2, page 82). We 
excise the space X Q — B r from the pair (X a , X a — int B r ), and obtain H(X Q , X Q — 
mtB r ) ^ H(X Q - (X a -B r ),X a -intB r - (X a -B r )) = H(X Q n B r , X Q n dB r ). 
Since the closure of X Q — B r needs not be contained in the interior of X Q — int B r , 
there are some technical conditions which require some care. See Appendix [X] 
for details. □ 

We now show that we can approximate local homology at multi-scale via the 
a-filtration using sample points. We begin with sequence ([2]). Specifically, we 
first consider the filtration corresponding to the whole space {X a }, and then the 
filtration corresponding to the subspace we quotient by, {X a — int B r }. The key 
is a technical result described in Theorem [l] which says that if we can approxi- 
mate filtrations independently, we can approximate their corresponding quotient 
filtration. We consider this result to be of independent interest. 

Theorem 1. If we have two compatible filtrations interleaved with two other 
compatible filtrations, the relative filtration is also interleaved. Formally, if com- 
patible persistence modules J- = {F a } ae ^ and Q = {G a } Qe R are Si -interleaved, 
A = {A a } ag R and B = {-B Q } Q gR are e^-interleaved, then the relative modules 
{(F a , A a )} a< zR and {(G a , B a )} ae ]si are e -interleaved, where e = max{£i, £2}. 

Proof Sketch. Without loss of generality, assume E\ = £2 = £■ Each pair, {(F, A)} 
and {(G, B)}, gives rise to a long exact sequence which are related by the inter- 
leaving maps. This gives the following commutative diagram: 
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H n (A a ) H n (F a ) j " > H n (F a ,A a ) fc ° > H n _i(/1 Q ) -^-A H n _!(F a ) 



H n (-Bo,+e) > H n (G a +e) > V\n(G a +s, B a ^ 



1-1 /"-I 

H n -i(Sa+c) — > H n _i(G a + e 



Wn(A a +2e) — > H n (_F a +2e) — > H n (-F a +2e , A a +2e ) — > H n _l(yl a +2e) ► H n _i (7 ? a +2 e ) 

To prove that the inter-leavings between individual modules imply an inter- 
leaving between {(F, A)} and {(G,B)}, we would need some careful diagram 
chasing at the chain level. That is, we need to prove each of the four diagrams 
(reviewed in Fig. [IJ needed for interleaving commutes, i.e. diagrams in Fig. [6] 
commute. The key issue is that although each row is exact, maps between per- 
sistence modules do not split — therefore we may have one persistent relative 
class without a persistent class in either component nitrations. The full details 
of the proof (with digram chasing arguments) are given in Appendix [A] ED 



H n (F a , A a ) ► H n (F a + 2e, A a + 2e) H„ (F a +e, A a + C ) 




H n (G Q1 + e, Ba + [) H„(G a , Bo) > H n (G Q + 2 e , -B Q + 2e ) 

Fig. 6: Commuting diagrams for e-interleaved persistence modules. 

Before we state our main theorem below, we define the set of sample points 
which lie outside the ball, L = {p £ L\p £ B r }, and L a — U xe iB a (x). We have: 



Theorem 2. The persistence module with respect to the Vietoris-Rips filtration 
of {(L a , L a )}, that is, {(Tt a (L),1Z a (L))} is ^2e + a + ~j -interleaved with the 
a-ftltration, {(X a ,X a — intS r )}, for a < r. 

Proof Sketch. Since we would like to approximate the persistence diagram of the 
pair {(X Q ,X Q — int_B r )}, we could approximate each filtration independently. 
We describe the key ingredients in our proof here and defer the technical details 
involving each step to Appendix [A"| 

First, we consider the whole space filtration {X Q }, and show that {X Q } is 
e-interleaved with {L a } (Fig.[7fa)), which is relatively straightforward assuming 
L is an e-sample of X. Since the nerve of L a is the Cech complex C a (L), then 
{X a } is e-interleaved with {C a (L)}. 

Second, approximating the subspace filtration {X Q — int B r } is more involved. 
The straightforward approach is to simply remove intS r from {L a } and con- 
sider {L a — int_B r } (Fig. (7jhV). This is computational expensive, so instead, we 
consider {L a — int_B r } (Fig. mc), note its subtle difference with Fig. mb)). We 
then show that {X Q — int_B r } is 2e-interleaved with {L a — intB r }, by showing 
that L is a 2e-sample of X — int B r , that is, removing sampled points in the ball 
gives a good sample of X — int B r . 
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(a) (b) (c) (d) 

Fig. 7: (a) L a , (b) L a — int B r , (c) {L Q }-int5 r , (d) {L a }. 

Third, wc further prove that {L a — inti?,,} is ( ^^-interleaved with {L a }. 
This is illustrated in Fig. [7^d), where we allow the offset to intersect inside 
the ball, and show that the error remains controlled. In addition, the nerve 
of L a is the Cech complex C a (L). A combination of these results implies that 

{Xq, — xatB r } is ^2e + -interleaved with {C a (L)}. Finally, having shown that 
we can approximate both the filtration on the whole space and on the subspace 
(which we quotient by) , we invoke Theorem]!] Finally, based on that the Vietoris- 
Rips and Cech complexes are a-interleaved, that is, lZ a C C a C lZ2a, we obtain 
the additional a factor in the approximation result. However, the limiting factor 
in this case is the subspace filtration. The results are only meaningful over a 
range of values for a. For example, if a > r, the local homology is isomorphic 
to a (d+ l)-sphere. This is discussed further in Appendix |A} ED 

Computation. Using Theorem [2] we can compute relative persistent homology 
of the nitrations built on the sample points using the algorithm described in [28] . 

4 Approximating Local Homology: r-Filtration 

In this section, we describe approximating local homology with respect to a fixed 
point x at multi-scale via r-filtration (Fig. [3]). We fix a thickening parameter a 
and drop it from the notation, using only X. Consider the following filtration, 
for r > s > t, 

...->■ H(x,xns r ) H(x,xnB s ) H(x,xns*) (3) 

Now we endow the space X with a function g : X — > R, which is the Euclidean 
distance to a fixed point x, g(x) — d{x,y) — d x {y). g could be viewed as the 
restriction onto the space X, of a Euclidean distance function to a point x, 
d x : E d — > R, that is, g = d x \ x . The function g is 1-Lipschitz and we see that 
X n B r = g~ l [r, oo), the superlevel set of g. The above sequence becomes, 

H(X,.g- 1 [r,c5o)) -> H(X, g- 1 [s, oo)) -> H(X, g- 1 ^, oo)) . . . . (4) 

This is the relative persistence module of g. Now let / = — g : X — > R, / is also 
1-Lipschitz. Sequence §5§ holds the same information as sequence Q assuming 
tame functions according to Extended Persistence Symmetry Corollary [29] 

6 It is unclear whether this holds in the case of non-tame functions which could arise 
as a consequence of a pathological underlying space. 



8 



(that is, the ordinary persistence diagram of a function / equals the relative 
persistence diagram of — / up to a dimension shift and central reflection), 

. . . HCT^-oo, a}) -> Htf-^-oo, b}) -> H^-^-oo, c]) . . . , (5) 

where a < b < c, which corresponds to the persistence module of / based on 
its sublevel sets. Since the nitrations in sequence ^ and sequence Q hold the 
same information, we can translate the diagram and recover the information for 
the original r-filtration (sequence 

The key insight is that in this case, the r-filtration amounts to studying 
the persistent homology of a function on the space — the distance function to 
a point, which is a particularly nice function, i.e. 1-Lipschitz. In this section, 
we give results under strong assumptions on the space X with some further 
discussions deferred to Appendix [Bj 

We introduce a strong assumption on homotopy between a pair of spaces, 
which requires that points are only moved a bounded amount in the homotopy. 
Two subsets of Euclidean space, X and Y are e-homotopy equivalent, if there 
exists two functions i : X — > Y and h : Y —> X such that hoi is homotopic to the 
identity map id%, ioh is homotopic to zdy, d(p, hoi{p)) < e and d{p, ioh(p)) < e. 
The consequence of such an assumption is discussed further in Appendix |5J In 
our context, the map i is typically the canonical inclusion map, therefore if 
d(p, i o h(p)) < e then d(j>, h o i(p)) < e. Then we refer to h : Y — > X as the 
e-homotopy equivalence between Y and X, where d(j>, h(p)) < e. 

The first step in approximating the r-filtration is relating the sublevel set 
filtration of a 1-Lipschitz function / : X —> M. on the space X, and the sublevel 
set filtration of a corresponding function f e : X e —> R on X e . These nitrations 
together with maps induced by space inclusions form the (homology) persistence 
module of / and f £ , respectively. Recall that / is the negative of d x restricted 
to X, / = — d x \ x . Likewise, f e — — d x | x . Since there is an inclusion X <-> X e , it 
follows that / = / e | x . For the rest of the section we use the following notation 
for sublevel sets: F(a) = oo,a], F e (a) — /^ 1 (—oo,a], for every a 6 E. The 

persistence module of / and f e are represented as {H(F(a))} a and {H(F e (a))} a 
respectively. 

Lemma 2. Suppose X and X e are e-homotopy equivalent through the canonical 
inclusion map i : X — > X e and the map h : X £ — > X. Then the persistence modules 
off and f £ , that is, {H(F(a))} a and {H(F e (a))} a , are e -interleaved. 

Proof. Consider the following sequence of maps: 

F(a) A F e {a + e) ^ F{a + 2e). 

We define the map i' — i\p/ a \ and show that i' is well-defined. Vp € F(a), 
by definition, we have f(p) < a and / = ,f £ | x , therefore f e (p) < a. This implies 
that p £ F e (a) C F e (a+e), therefore, i' is a well-defined inclusion, which induces 
inclusion on the homology level, u : H(F(a)) — > H(F e (a + e)). 
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We define h! = h\ F ( Q+£ ), and we need to show that h! is well-defined, that is, 
the image of h! lies in F(a + 2e). \/p € F e (a + e), by definition, we have f e (p) < 
a + e. Since f e = — d x \ x ^, then —d(x,p) < a + e. Combining with d{p, h{p)) < e, 
we have f(h(p)) := —d(x,h(p)) < d(p,h(p)) — d(x,p) < a + 2s. This implies 
that h(p) g F(a + 2e). Therefore h' is well-defined. In addition, based on our 
assumption that X and X e are homotopy equivalent through maps i and h, this 
implies that h! is a homotopy equivalence, which induces an isomorphism on 
the homology level, K : H(F e (a + e)) -+ H(F(a + 2s)). 

In order to show persistence modules {H(_F(a))} a and {H(F e (a))} a are e- 
interleaved, we could easily verify that the four diagrams in Fig. [T] commutes 
based on linear maps and ft.*. 

The next step is to relate the above filtrations to the union of balls on 
the samples. For notational convenience we define the union of balls centered 
around points with a function value less than some threshold a £ R as U £ (a) = 
U P £L,f(p)<a,B e (j)), where afK and a < 0. Since U e (a) contains Euclidean balls 
which are convex, the Never Lemma holds, that is, its nerve Af(U e (a)) 1 which 
corresponds to the Cech complex C £ (a), and U e (a) are homotopy equivalent. As 
a varies, these complexes together with the maps induced by inclusions form a 
persistence module {H(C e (a))} a . Similarly we define the corresponding Vietoris- 
Rips complex and its persistence module as 1Z £ ( a ) and {H(lZ e (a))} a respectively. 

Lemma 3. Suppose X and X e are s-homotopy equivalent through the canonical 
inclusion map i : X — > X e and the map h : X £ — > X. Suppose L is an e-sample 
of X. Then the persistence modules {H(F(a))} a of f and {H(C £ (a))} a are 2e- 
interleaved. 

Proof. The proof is nearly identical to the proof of Lemma [2j Consider the 
following sequence: 

F(a) ^U e (a + e)^ F{a + 3e) 

We define the map i' = i\ F ( a j and show i' is well-defined. Vp € F(a), by defini- 
tion, f(p) — —d(x,p) < a. Since L is an e-sample of X, there exists q € L such 
that p € B £ (q), that is, d(p, q) < e. Combining the above inequalities, we obtain 
f(q) — —d(x, q) < —d(x,p) + d(p, q) < a + e, implying that p G U e (a + e). 

For map h! , since U £ (a + e) C F e {a + 2e), based on the results in Lemma[2] 
that the map F t {a+2e) — > F(a+3s) is well-defined, we can define hi = h\xj t t a +e)- 
Following similar argument in Lemma [2j {V\{U £ {a)Y\ a is 2e-interleaved with 
{H(F(a))} a . By the Nerve Lemma, the union of ballsFJ is homotopic to the Cech 
complex for all a, leading to H(C E (a)) = H(C/ e (a)). □ 



7 Note the definition of the union of balls filtration - it precisely equals to the lower 
star filtration of the Cech complex. 
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Theorem 3. Suppose h is an 2e-homotopy equivalence between X2 £ and X, and 
L is an e-sample o/X. Then the Vietoris-Rips filtration {H(7?-2£(a))}a * s a 4e- 
approximation of the r-filtration {H(F(a))} a . 



Proof. Suppose X2 £ and X are 2e-homotopy equivalent through the canonical 
inclusion map i : X — > X2 £ and the map h : X2 e — > X. We can construct the 
following commutative diagram on the space level: 





a) 




+ <0 




+ <0 ^ 

U 2e (c 




— F(a 


+ 5e) 




+ 6e) 




+ 6e) 





U e (a + 6e) TZ 2e (a + 6e) U 2e {a + 6e) 

First we consider the top and bottoms rows in the diagram. The 1st map is an 
inclusion on the space level. The 2nd and 5th maps are homotopy equivalences 
based on the Nerve Lemma (which induces isomorphisms on the homology level). 
The 3rd and 4th maps are inclusions based on interleaving between Cech and 
Vietoris-Rips complexes, i.e. C e C TZ 2 e Q C 2 e- Second, all the vertical maps 
between the top and bottom rows are inclusions. Finally, we define the connecting 
map U 2e {a + e) —> F(a + 5e) as h' = h\ U2 ^ a+£ y To show h' is well-defined, 
Vp € U2e{ot + e), f(p) < a + 3e, since hf is a 2e-homotopy, h'(p) has a function 
value at most a + 5e, therefore h'(p) € F(a + be). 

From the above commutative diagram, we consider the following maps be- 
tween spaces: F(a) —> lZ2 E {a + e) —> F(a + 5e). This leads to a factor of 4e in 
the interleaving between persistence modules {H(7?.2e(a))}a and {H(F(a))} a . ED 

Computation. Here we have reduced the computation of persistent local ho- 
mology to standard persistence on the sample points [314] . 

5 Discussion 

Local homology and relative homology are common tools in algebraic topology. 
In this paper, we recounted two different multi-scale notions of local homology: 
the a- and r-filtrations. We show that both can be well-approximated using 
Vietoris-Rips complexes based on a finite sample of the space and therefore 
efficiently computed. We also prove a novel technical result involving interleaving 
between relative persistence modules derived from interleaving between absolute 
persistence modules. Several open questions remain: Are there better geometric 
measures to describe the sampling conditions in approximating local homology? 
Could a similar sampling theory be developed for witness complexes? Under 
what conditions on the space are the underlying nitrations we study tame? 

Our work was motivated by stratification learning. However the results in 
this paper could be applied to any applications where local or relative homology 
computations are relevant, i.e. for future directions, the approximation of Conley 
index or well groups [30] . 
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A Detailed Proofs for Approximating Local Homology at 
Multi-scale via a-filtration 

A.l Lemma Q] 

Lemma [TJ Assuming that spaces X a and X Q — int B r form a good pair, then 
H(X a n B r ,X a C\dB r ) = H(X«,X Q - intS r ). 

Proof. First we recall several theorems related to excisions. Let Y, U, A be topo- 
logical spaces. The inclusion map of pairs (Y — U, A — U) — > (Y, A) is called an 
excision if it induces a homology isomorphism. In this case, one says that U can 
be excised. We will make use of the following two results about excision (|27j). 

Theorem 4 (Excision Theorem). (fFtf , Theorem 15.1, page 82) If the clo- 
sure of J] is contained in the interior of A, that is, clU C int A, then U can be 
excised. 

Theorem 5 (Excision Extension). (1271, Theorem 15.2, page 82) Suppose 
VcUc A and (i) V can be excised; (ii) (Y — U, A — U) is a deformation retract 
of (Y - V, A - V). Then U can be excised. 

In our context, let Y = X Q , A = X Q — int£> r , U = X Q — B r . Therefore 
Y — U = X Q n B r and A — U = X Q n dB r . However, since clU needs not be 
contained in int A, so we must define a suitable V C U. One direct way is to 
choose some small enough positive 8 and a neighborhood I, such that we define, 
I = X Q n dB r n clU, l s = {x e clU | di(x) < 8}, and V = U - I s , where d t (x) is 
the Euclidean distance from x to the set I. 

The existence of this 8 follows from the assumption that the pair (X Q , X Q — 
int-B, ) := (Y,A) form a good pair. This is a technical condition which implies 
the existence of a neighborhood of Y — U (i.e. Y — V) that deformation retracts 
to Y — U. It is then straightforward to verify that V C U C A satisfies the 
hypotheses of Theorem [5] 

Therefore the chain map k : C(Y, A) — > C(Y — U, A — U) is an excision. It 
is defined as k — r# ° s, where r# is the chain map induced by the retraction 
r : (Y — V, A — V) — » (Y — U, A — U), and s is the chain- homotopy inverse 
of the chain map included by the inclusion of pairs (Y — V, A — V) — > (Y, A), 
«:C(Y,A)->C(Y-V,A-V). ED 



A.2 Theorem [TJ 

We describe our long and technical proof of Theorem[l]based on diagram chasing. 
Wc first need the following lemma that comes from the short exact sequences of 
a pair ([22], page 140). 

Lemma 4. The quotient map on the chain level commutes. That is, for com- 
patible maps A — > B and X — > Y there is a map (X. A) — > (Y, B) such that the 
diagram in Fig. [S] is commutative. 
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C n (A) 
f 

C n {B) 



C n (X) 
9 



C n (X,A) 
h 

C n (Y,B) ■ 



-> 



-> 



Fig. 8: Commuting diagrams on the chain level. 



Proof. The assumption of compatibility ensures the left square commutes. Note 
that i,j must be injective maps and in all the case we consider / and g are also 
injective, which is sufficient for compatibility. To define h we note that im h = 
img/(im (goi)(Qimj). To show that the right square commutes (hoq = rog), we 
note that any class in im (r o g) must be in im q by exactness and the assumption 
that the left square commutes (goi = jo /). Since it is not in imi or map to imj, 
it is in xmh. Alternatively, any class in im (h o q) must have a lift to C(Y) since 
r is a surjection. This must be in im g by the definition of h, which concludes 
the proof. ED 

Theorem [1} If we have two compatible nitrations interleaved with two other 
compatible nitrations, the relative filtration is also interleaved. Formally, if com- 
patible persistence modules T = {.Fa}aeR and Q — {G a } ae K are ei-interleaved, 
A = {A a } ae ]SL and B = {-B a } Q gR are £2-hiterleaved, then the relative modules 
{(F Q , J 4 Q ,)} cte R and {(G Q , £? q )} qS r are e-interleaved, where e — max{£i,£2}. 

Proof. We begin with a list of notations. Suppose {F} and {G} are compati- 
ble and are e-interleaved with homomorphisms {f a : H(F a ) — > H(G a + £ )} and 
{g a : H(G Q ) — > H(F Q+e )}. Suppose {A} and {B} are also compatible and s- 
interleaved, with homomorphisms {tp a : H(A a ) — > H(i? Q+e )} and {ip a : H(B a ) — > 
H(A a+e )}. For relative homology to be well-defined, we have injective maps at 
chain level, for simplicity, we further require A a «-> F a and B a ^ G a . 

We would like to prove that {(F,A)} and {(G,B)} are also interleaved, and 
we could construct their corresponding homomorphisms, {/i Q : H(F a , A a ) — > 
H(F a+£ , A a+£ )} and {v a : H(G a ,B a ) -> H{G a+e , B a+£ )}. 

To prove the result, we pass to the stack of long exact sequences in Fig. 
[9] First, we explain the notation. A map, i.e. 0", represents a map that maps 
n-dimensional homology groups of A a to some other homology groups. We note 
that all the squares in this diagram (Fig. [9| commute based on Lemma |4j and 
by assumption the two component nitrations are interleaved, so the first, sec- 
ond, fourth and fifth columns commute with the maps induced by inclusion. 
For example, the map induced by inclusion im(H„(F Q ) — » M n (Fa+2e)) equals 
im (g" +e ° /")■ Commutativity implies interleaving in some of the cases. We 
prove the following triangle commutes (Fig. [6| through four claims. 

Claim 1: if a relative class is in im (H n (F a , A a ) — > H n (F a+ 2e, A a+ 2e)), and 
it is in imj" and imj^ +2e , then it is in im<7" +£ . 
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H n (A a 



-4 U n {F a ) J - ► H n (F a ,A a ) 

fn 



-> H n _i(_F a ) 

fn — l 



Pit In r n Pn-1 

H n (-Ba+e) — " ► H n (G Q + e ) ► H n (G a +e, -Ba+e) — - — ► H n _i(_B Q + e ) > H n _i(GcH 



H„(^4 a +2 £ ) — > H„(_F a+ 2 £ ) — > Y\ n (F a+ 2 E , A a+ 2 e ) — > H„_i(A a+ 2 e ) > H„_i(_F a+ 2 £ ) 



Fig. 9: Commuting diagrams for the long exact sequence involving two pairs of 
nitrations. 



If a relative class 7 in im (H n (F a , A a ) — > H n (F a+ 2 e , A a+ 2 e )) is in imj" and 
im j" +2e , then by the interleaving, it must be in H„(G a+e ). Therefore suppose 7 
is not in imq" +e , it must have a preimage in H n (B a+e ). Since 7 is in imj^ +2e , 
it does not have a preimage in H„(^4 Q+2e ). This would imply that the lower left 
square does not commute {g" +e °p" +e 7^ in +2e V'n +e )- That is a contradiction, 
therefore it must be in img" +e . 

Claim 2: If a relative class is in im (H„(F Q , A a ) — > H n (F a+ 2 e , A a+ 2 e )), and 
it is in cokj" and cokj™ +2£ , it must be in cokq" +e . If the relative class 7 in 
im(H n (F a ,A a ) H„(F a+2e ,F Q+2£ )) is in cokj^ and cokj^ +2e , then by exact- 
ness 7 maps into imfc" and im/c" +2e , that is, it maps to a non-trivial element 
in H n -i(A a ) and H„_i(j4 Q _|_ 2e ). By the interleaving between A and B, it must 
also map to an element of Y\ n -i(B a+t ). Furthermore, it must be in keri™^. 
Therefore suppose 7 is not in cdkq" +e (or equivalently, imr" +e or kerp"j^) , 
it must map to a class in H„_i(G , Q , +e ), which implies that the top right square 
does not commute {f"_ 1 o ^ p^-t <t>%-i) leading to a contradiction. 

We now show that commutativity is not a sufficient argument. Consider a 
persistent relative class in H n (F Q , A a ) —> H„(F Q+2c , A a+2e ) such that it is in 
imj" and cokj" +2e . Alternatively, it may be in cokj" and imj" +2e . In these 
cases, we may map this class to zero the middle row and still maintain the com- 
mutativity of the diagram (although this implies the relative filtrations are not 
interleaved). This problem stems from the fact that the maps between persistent 
modules do not split (The relative persistence module does not split into direct 
sum of the image and cokernel in the long exact sequence) . 

Claim 3: If the relative class is in im(H n (F a ,A a ) — ► H„(F Q , +2c , A a+2t )), 
then it is not possible that it is in im j" and cok j" +2£ at the same time. 

First we handle the case where the relative class is in im and cokj" +2e by 
showing this cannot occur. Since it is in im j" at the chain level, there is a cycle 
representative in Z n (F a ). Since this maps to a cycle representative in Z„(F Q+2e ), 
this implies that the cycle is in the boundary. However, looking at the relavent 
part of the short exact sequence shown in Fig. [lO] 

The cycle representative in C n (F a+ 2 e ) lifts to some element in C n +i(F a+2e ). 
Now by assumption, there is still some cycle representative in C n (F a+2e , A a+2e ). 



1G 



ri + 1 a ) 

d 

C n (F a + 2a) 



C n +l(F a +2e, A a +2e 
d 

a + 2e ) 



Fig. 10: Short exact sequence on chain level. 



By commutativity, the bounding element in C n +i(.F Q +2e) must also map to a 
bounding element of the cycle representative in C n (F a+ 2 e , A a+ 2 e ), meaning it 
cannot be a relative homology class. If on the other hand, the cycle representative 
in C n (F a+ 2e) is in the kernel of the quotient map, a relative homology class would 
appear one dimension up. This is the case we deal with next. 

Claim 4: If the relative class is in im (H„(F Q , A a ) — > H„(F Q+2e , F a+2t )), and 
it is in cokj" and imj" +2e , then it must be in imq" +e or cokg" +e (i.e. it must 
be H„(G a+e , B a+£ )). 

For a relative class in cokj", there is a cycle representative in C n _x(j4 a ) of 
the corresponding class H„_i(j4 a ) which by the injectivity of the interleaving, 
maps to a cycle in C„_i(i? Q+e ) and C„_i(^4 ct+ 2 £ ). Further, since it is in cokj", 
it follows that it maps to a bounded cycle in C n -i(F a ) (and by injectivity) the 
corresponding cycle representatives in C ?l _i(G Q + e ) and C n -i(F a +2e) are also 
bounded. Since this relative class is assumed to be in imj^ lpha+2e , it follows that 
the cycle representative in C„_i(A Q , +2£ ) is now bounded, with the pre-boundary 
mapping to a cycle in C n (F a+ 2e) ■ This follows from a chain level understanding 
of the exactness of the bottom row. Take the representative (n — l)-cycle in 
C n -i(A a ) denoted by a and map it into C„_i(F a ). i"_ 1 (a) has a pre-boundary in 
Cn(-Fa) which maps to the cycle representative of the relative class in C n (F a , A a ). 
This is just the connecting homomorphism construction. If we map this relative 
cycle representative to C„(F a+2e , A a+2e ), since the class is in j" +2e , it lifts to 
a non-trivial cycle in Q n (F a+ 2e)- This cycle is precisely the image of the pre- 
boundary of a in C n ^i{A a+ 2e) mapped to C n {F a +2e) phis the pre-boundary of 
the image of «"_i(a) in C„_i(F Q+2e ). 

There are two case to consider. If 0"_ 1 (a) is a non-trivial cycle, then there is 
a homology class in kerp°^ a+e and by exactness, a corresponding class in the 
cok g «+ e . 

If 4>n-i ( a ) maps to a bounded cycle, then by the same reasoning as above, 
the pre-boundary of this cycle in C { n B% +e must map to a non-trivial cycle in 
CnG" +e . Hence there is a corresponding class in imq" +e . Proving the claim. 

Following the above four claims, we've shown the triangle in Fig.[6]commutes. 
Fig. [6] equals the trapezoid in Fig. [TT|(a) by setting a' = a. It follows that the 
trapezoid in Fig. |ll[ a) commutes based on similar diagram chasing argument. 
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The other diagrams in Fig. [TT] follow similar proofs. For example, to show 
that the diagram in Fig. |ll[ d) commutes, the argument goes through in precisely 
the same way, on diagrams shown in Fig. [12] and Fig. |13| 



H(F a , A a ) 



H(-/ ? a '+2e, A a i + 2e ) 



H(G a + e , B a -f 



(a) 



H(-F a + e , A a + t ) > H(F a + e ,A a 



H(G a , B a ) 



H(G Q ' + 2 e , B a i + 2e) 



(P) 



H(-F a + e , A a i +e ) 



H(G Q , B a 



H(G a i, B a i) 



(c) 



H(F a ,A a ) 



H(F a r, A a i 



H(G a + e , B Q 



H(G a '+ < ;, -B 



Fig. 11: Commuting diagrams for e-leaving of the pairs. 

This shows that the two commute and hence we conclude that the relative 
filtrations are interleaved. ED 



A.3 Theorem [2] 

First we prove a collection of lemmas ([5] [6j [7j [8j [9] [To| that are relevant in 
proving Theorem [5] 
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H(A a 



H(F a ) 



H(F a ,A a 



H{A a ) 



H (F a 



H(A 



H(F Q 



H(F Q ,,A Q 



H(A D 



H(F a 



H(-B a '+ E 



H(G a / +E ) 



H(-Bq'+ E ) 



H(G Q 



Fig. 12: Commuting diagrams for Fig. [ll] (d) top path. 



H(A a ) 



H(-B a '+ e ) 



H(F„) 



H(G Q + e ) 



H(G a '+ e ) 



H(F a ,A Q ) 



H(G a + e, -BqH 



H(A a ) 



H(B Q 



H(-B Q '+e) 



H(F Q ) 



H(G a +£) 



H(G 



Fig. 13: Commuting diagrams for Fig. [IT] (d) bottom path. 



Lemma 5. If L is an e-sample o/X then {X Q } is e-interleaved with {L a }. 

Proof. Given that L is an e-sample of X, by definition, L C X, this implies 
that (a) L a C X Q and (b) £ Q + e C X Q+e . Subsequently, we would prove by the 
triangle inequality that, (c) X a C L a+£ . Combining (a), (b) and (c), we have, 



L„ C 



C L a+e C X a+e . 



By the special case of e-interleaving, we have L a C X Q+e and X Q C L a+e , 
therefore the persistent homology modules of {X Q } and {L a } is e-interleaved. 

Now we prove that the inclusion in (c) holds. For any point p € X a , let 
q = a,Tgmiii xe .xd(p,x), therefore by definition of X a , d(p,q) < a. Since q € X 
and L is an e-sample, let s — arg min zg £ d(p, z), by definition of L, d(q,s) < e. 
By triangle inequality, s) < g) + s) < a + e. Therefore p G L a+e . ED 



Lemma 6. The nerve of L a , Af(L a ), is homotopic to L a . 

Proof. This is an application of the Nerve Theorem. Since these are Euclidean 
balls in Euclidean space, they are all convex as are all their intersections. They 
are hence contractible and the Nerve Theorem applies. ED 
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Lemma 7. If L is an e-sample o/X, then L is ale sample o/X — int_B r . 

Proof. Consider a point outside in X but not in inti? r . If it is covered by a 
sample lying outside of int-B r , then it is still with e of a sample point. If it is 
covered by a point within int B r , then the closest sample point outside of int B r 
can be no further than 2e. This follows since all points e away from the int B r 
cannot be covered by a sample point which lies within int B r , and therefore any 
point outside int B r but covered by a sample point within int B ri lies at most 2e 
from a sample point with lies outside int B r and so is in L. 

Formally, consider a point p G X — int B r . Let s = argmin ze £ (p, z), that is, 

p is covered by s. If s is outside of int-B r , that is, s € L, then d(p,s) < e. If 
s € int£> r , let t = arg min 2g £ d(p, z), we claim that d(p,t) < 2e. Therefore L is 
a 2e sample of X — int B r . Now we prove the claim that d(p, t) < 2e. We could 
prove by contradiction. Suppose d(p, t) > 2e and p is just on the boundary of 
X fl B,.. Then there exists at least a point z that is e away from p that is not 
covered by any sample point in L. This contradicts with L being an e-sample. 

□ 

Lemma 8. {L a — int_B r } is It-interleaved with {X Q — int £?,,}. 

Proof. The proof follows from the Lemma [7] and precisely the same argument as 
in Lemma [5] ED 



Lemma 9. For a < r, the nerve of L a — int B r is homotopic to the union of 
balls L a with int B r removed. 

Proof. Since we are removing the ball the intersections are no longer convex. 
However the condition a < r ensures that they are still contractible. This is only 
an outline of the proof. The goal is to prove that from any intersection there is 
a homotopy to a convex body and hence all the intersections are contractible. 
Take an arbitrary intersection. If it does not intersect intB r , it is convex. If it 
does, then take the tangent plane to B r at a point on the boundary within the 
intersection. Clear the half-plane which does not contain the B r intersected with 
the intersection is convex and hence contractible. 

The rest of the intersection can by retracted to the tangent plane, which 
we prove by giving an explicit deformation retract. The tangent plane will be 
referred to as T(s) (the tangent plane at point s). 

First, we define a deformation retract before we remove B r (x). We consider a 
straight-line homotopy to the T{s) by projection. We project each point p to T{s) 
within the intersection. We call this point q. By convexity of the intersection, this 
path is a geodesic which lies completely in the intersection. It is also continuous. 
With B r (x) removed this will remain a valid deformation retract if all geodesies 
remain in the space (i.e. pass through B r (x)). The points p, q, s as above shown 
in Fig. |§a). 
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Fig. 14: (a) The layout of the points p, q, s along with the deformation retract, (b) 
The situation when a is positive, (c) The situation when a is negative (cannot 
occur), (d) A bound on the distance between p and s. Note that although this 
is in high dimensions, these figures are general since we can restrict ourselves to 
the plane defined by p, q, s. 



We prove that the geodesic does not leave the space by contradiction. With- 
out loss of generality assume the point p is on the boundary. To leave the space, 
it must cross the boundary of B r (x), and the shortest path from that point must 
also go through the ball. In particular, we see that to pass through B r (x), the 
geodesic must form a negative angle with the tangent plane, shown by a (Com- 
pare Fig.^b) and (c)). Since the line (p,q) is a shortest path to T(s), it must 
be perpendicular to T(s). This implies that the angle between T{p) and T(s), 
denoted by j3, must be acute. 

This, however implies that the poi nt o f contact of the two hyperplanes is at 



least \^2r far apart as shown in Fig. 14 d) . Since we can choose the point of 



contact such that no point on the ball in the intersection is more that a from 
the point of contact and a < r (this is obvious if we take 2a < r), this implies 
that such a point cannot be in the intersection. 

Note that the original projection was to T(s) within the intersection. This 
means that (p, q) may not be perpendicular to T(s). However in this case, (p, q) 
will not go through the B r (x). Since (p, q) must form a chord of a ball of radius 
a, passing through B r (x) would generate a chord in B r (x). This implies that 
either the center of the ball of radius a lies within B r {x) or that a > r. 
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Hence, the projection to T(s) is a deformation retract and the non-convex 
part is contractible as well. ED 



Lemma 10. {L a — inti? r } is (^^r J -interleaved with {L a }. 

Proof. This proof works at the nerve level. We show that if an intersection be- 
tween balls exists in L a it will exist in L a+a 2 i r — int B r . Clearly any intersection 
L a — int B r is also in L a . If an intersection is in L a but not in L a — int B ri this 
implies it lies in int-B r . Assuming that there is an intersection contained within 
int B r . Note that the furthest this intersection can be from the edge of int B r is 
bounded by a 2 /r. The derivation can be found blow. Hence the two nitrations 
are (a 2 /r)-interleaved. 

Now we focus on the derivation of (a 2 /r)-bound. 




Fig. 15: The geometric situation illustrating how deep in the interior of B r two 
offsets from the exterior of the ball can intersect in terms of the radius of the 
ball r and the offset filtration parameter a. 



To prove that the we still get a good approximation we need to show that if 
offsets intersect in the ball, they will intersect soon after outside the ball. The 
situation is illustrated in Fig. [l5j Normalizing by r, it is a basic geometric fact 
that 




The distance we must bound, by the Pythagorean theorem is 
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Since yfl-g > 1 - f£ for < £ < 1 
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Multiplying by r, we see that for a < r, we see that any simplex (intersection 
of balls) in {i Q }ae[o,oo) will be in {L Q / — int -B r } Q 'e[o,oo) for a + a 2 /r < a' < r. 
We obtained our desired bound. 

□ 

Finally we prove our main theorem for a-filtration. 
Theorem [2[ The persistence module with respect to the Vietoris-Rips filtration 
of {(L a , L a )}, that is, {(lZ a (L),lZ a (L))} is (2e + a + ^-^j -interleaved with the 
a-filtration, {(X Q ,X Q — int-B r )}, for a < r. 

Proof. Lemma [5] tells us {X Q } and {L a } are e- interleaved. Lemma [6] shows 

Af(L a ) ~ L a . Lemma[8]states {X Q — int B r } and {L a — mtB r } are 2e-interleaved. 

Lemma 9 shows M{L a — int_B r ) ~ L a —mtB r . Lemma 10 indicates {L a — mtB r } 

~ — 2 — 
and {L a } are -interleaved. 

Lemma 8 and 
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I implies that {X a — int B r } and {L a } are (2e+ ^^-interleaved. 

Combined with Lemmapl we have the relative modules, {(X Q , X Q — int B r )} and 
~ 2 — 

{(i Q , L a )} are (2e+ ^r)-interleaved. This means, the persistence diagram of the 
Cech filtration of {{L a , L a )} is (2e + — ) approximation of the persistence dia- 
gram of a-filtration. 

Now we consider Cech filtrations for both {L a } and {L a }, that is, {C a (L)} 
and {C a (L)}. Since both are a- interleaved with their Vietoris-Rips counterparts, 
that is, {C a (L)} is a-interleaved with {lZ a (L)}, and {C a (L)} is a-interleaved 
with {lZ a (L)}. We lose a factor a in the approximation by switching to Vietoris- 
Rips filtration of {(L a , L a )}. ED 



B Discussion on the r-filtration 

Here we give a short discussion on the assumptions made in the Section [4] and 
relate it to existing work. Our primary assumption is an e-homotopy equivalence 
between a pair of spaces. This is a strong assumption since it requires that 
points are only moved a bounded amount in the homotopy, essentially excluding 
situations illustrated in Fig. [16] Here we define the Euclidean distance function 
to the point p as d p (x) := d{p, x) = \ \x — p\\. Now consider d p restricted to X and 
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X a . Although X could be approximated via a retract from X Q , it is insufficient 
to guarantee that we could well-approximate the persistence module of d p \ x 
through that of d p \ x . In other words, in Fig. 16 the persistence diagram of 



these persistence modules could differ by at leastd. 




Fig. 16: The space X, its a-offset X Q and a point p. Consider the distance func- 
tion dp to p. This is an example where the offset X Q and space X are homotopy 



equivalent but the persistence diagrams of the functions d p \ 
potentially far apart. 



and cL 



are 



The problem of approximating a sublevel set filtration of a function on a space 
has been studied before. The setting is closely related to the results of [20]. In 
[20] . there is an approximation guarantee between a sublevel set filtration of 
a c-Lipschitz function on a space and an image persistence filtration on two 
nested Vietoris-Rips complexes with an appropriately chosen parameter. There 
are numerous requirements to apply such results, which we outline here. 

The first requirement is that we have access to geodesic distances or some 
provable approximation of it. While the geodesic distance can be inferred from 
the Euclidean distance in certain cases, this can be a difficult problem depending 
on how our space is embedded. The second requirement is that the space has 
positive convexity radius. While this is generally a safe assumption for mani- 
folds; for spaces where local homology yields interesting information, such as 
stratified spaces, this measure can often be zero (i.e. a cone has zero convexity 
radius). If, however such requirement is satisfied, we can apply the results in [20] 
directly. The resulting algorithm is to build the underlying simplicial complex 
using geodesic distances, which given a sufficiently dense sampling relative to 
the convexity radius, gives an approximation for any c-Lipschitz function. Since 
distance functions are 1-Lipschitz, the approximation results follow. 

This highlights a key obstacle in stating sampling results for function nitra- 
tions as well as an open problem we discuss below: in terms of which measures 
should we state sampling results? Is there a global geometric measure which is 
meaningful for stratified spaces? Are there weaker conditions than e-homotopy 
for approximating sublevel set behavior? As pointed out above, geometric mea- 
sures, such as reach or convexity radius can be zero even for nice spaces. It would 
be preferable to use quantifiers such as homological feature size |25| . Research in 
these directions is left for future work. 
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